Force induced unfolding of bio-polymers in a cellular environment: A model study 
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Effect of molecular crowding and confinement experienced by protein in the cell during unfolding has been 
studied by modeling a linear polymer chain on a percolation cluster. It is known that internal structure of the 
cell changes in time, however, they do not change significantly from their initial structure. In order to model this 
we introduce the correlation among the different disorder realizations. It was shown that the force-extension 
behavior for correlated disorder in both constant force ensemble (CFE) and constant distance ensemble (CDE) 
is significantly different than the one obtained in absence of molecular crowding. 
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I. INTRODUCTION 

Understanding of the structure and function of bio- 
polymers in vivo by analyzing it in vitro is one approach 
HI El HI 0], but another route is to perform the analysis in 
presence of the environment similar to in vivo Jll|^,|3,[a]. It is 
now known that the interior of the cell contains different kind 
of biomolecules like sugar, nucleic acids, lipids etc. These 
macromolecules occupy about 40% of the total volume with 
steric repulsion among themselves. This confined environ- 
ment induces phenomena like "molecular confinement" and 
"molecular crowding" and has major thermodynamic and ki- 
netic consequences on the cellular processes j^ UOllllll . In re- 
cent years, Single Molecule Force Spectroscopy (SMFS) ex- 
periments were mainly performed in vitro (absence of molec- 
ular crowding) to understand the cellular processes lfll [T2l[T3l 
[Till . Theoretical modeling of such processes with simplified 
interactions 

EH EDS SIM EE III EES amenable 
to statistical mechanics have been used extensively to com- 
pare the outcomes of these experiments. Such models despite 
their simplicity, have been proved to be quite predictive and 
provided many important information about the cellular pro- 
cesses. In all such modeling, the confinement imposed by 
the cellular environment has been ignored and surrounding 
environment has been considered as homogeneous. For ex- 
ample in protein unfolding each amino acid (monomer) inter- 
acts with its fixed number (depen ding on the lattice) of nearest 
neighbors M M EH M, MMl- 

In vivo, cellular environment and interactions involved in 
the stability of protein remain no more homogeneous. Hence 
in theoretical modeling, it is essential to use the underlying 
lattice to be disorder (or random) in order to study the effects 
of the cellular environment. If different biomolecules do not 
move with time the effect of such confinement is termed as 
"molecular confinement". If they move, then effect is termed 
as "molecular crowding". From the statistical mechanics per- 
spective for the "molecular confinement", quenched averaging 
is appropriate while for the "molecular crowding" constrained 
annealed averaging will be suitable. It is pertinent to men- 
tion here that unfolding process is a dynamical phenomenon 
of the order of micro seconds to few seconds. In such a short 
time span, internal structure of cell does not access all pos- 
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FIG. 1 : Schematic representation of a polymer chain on a disordered 
lattice. The filled circle represents the available site and open circle 
represents the occupied sites. The imposed restriction of p c above 
than the value 0.5928 represents the unavailability of fraction of sites 
("volume exclusion" which is about 40% as seen in the cell). One 
end of the polymer chain is kept fixed while a force F has been ap- 
plied at the other end. 



require the averaging over all possible internal conformations 
(— ► oo) of the the cell. Therefore, in present study, we restrict 
ourselves to finite, but a set of large disordered realizations 
which roughly reproduce the effect of the internal structure of 
the cell. 

The aim of this paper is to study the effect of an applied 
force on a polymer trapped in a random environment as shown 
in Fig. 1. The random environment mimics the effect of 
crowding agents as seen in the cell. The confinement of the 
polymer to a restricted portion of phase space leads to the loss 
in entropy. Therefore, effect of the applied force on the re- 
action co-ordinate (in this case end to end distance) is mainly 
determined by the loss of configurational entropy and gain in 
internal energy because of the nearest neighbor interaction (e). 
Because of the surrounding environment the average number 
of interactions per monomer is not constant, but may vary in 
between to 2. This introduces heterogeneity in the inter- 
action along the chain even in case of homopolymers. It is 
important to mention here that in this case the precise infor- 
mation about the reaction co-ordinate, probability distribution 
of the reaction co-ordinate on the parameters of the system is 
difficult to obtain analytically, therefore, one has to resort on 
numerical treatments. 
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FIG. 2: (a) Histogram of Z at F = and T = oo; (b) Corresponding histogram in log-log scale shows the power law behavior; (c & d) Same 
as (a, b) but for F = .75 and T = 0.3. 



introduce the model to mimic the cellular environment and de- 
scribe how we are considering the random correlated disorder 
environment. We also briefly discuss the exact enumeration 
technique in this section. In sec. Ill we present our main re- 
sults obtained in constant force ensemble (CFE) and constant 
distance ensemble (CDE). We show the force-extension be- 
havior in presence of crowding agents significantly different 
than the one obtained for the pure case which represents the 
unfolding process in vitro. We briefly discuss the consequence 
of molecular crowding and confinement in sec. IV. 

II. MODEL AND METHOD 

A. Random Environment: Correlated Disorder 

We model a polymer chain described by the self attract- 
ing self avoiding walks (SASAWs) [16] confined in a random 
environment. Such environment can be obtained by using 
the method developed for the percolation cluster. We choose 
probability for an unoccupied site equal to p w hich is just 
above the percolation threshold p c (= 0.5928) iElErj l^l^. 
This will offer a wide class of the distribution of clusters. 
Cluster size can vary from the isolated site to the length of 
the system. For the collapsed state the ground state energy 
arising due to the nearest neighbor interaction will not be con- 
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state energy is — eN which is a constant. 

Molecular crowding is a dynamical phenomenon involv- 
ing the appearance, disappearance and movement of voids. 
Therefore, the internal structure of the cell changes contin- 
uously. Recently Singh et al [29] have studied the effect of 
molecular crowding by considering the underlying lattice as 
percolation cluster where each realization of disorder was in- 
dependent of the previous disorder realization. In presence of 
molecular crowding agent, Yuan et al. 13011 has performed the 
unfolding experiment for the first time. In order to model the 
cellular environment, they used dextran as crowding agents. 
From this experiment and from the knowledge accumulated 
from the structure of cell, it has been revealed that the inter- 
nal structure of the cell changes over the time, but it does not 
change rapidly. Therefore, they consider the average effect 
of crowding agents on the unfolding. Hence their results do 
not distinguish between molecular confinement and molecu- 
lar crowding. Since, the cell structures at different time stages 
remain correlated with the initial (native) structure, therefore, 
each realization should also be correlated with the initial (na- 
tive) structure. In order to obtain the set of correlated perco- 
lation clusters, we choose a percolation cluster (native) and 
generate other clusters by changing certain fraction (in this 
case 10%) of the occupied site to unoccupied site and vice 
versa with respect to the native. This condition leads to a set 
of correlated percolation clusters, where each cluster is corre- 

_1 j 1 _ __ 1 - .__ _1 * £•£•_ '.I. 
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B. Exact Enumeration Method 
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FIG. 3: Figures show the variation of reaction co-ordinate with force 
at different temperature in CFE: (a) Representative F-x curve for dif- 
ferent disorder realizations at T=0.3. < x' n ' > is for initial (native) 
realization and < x^ 1 ' > is for other realizations which differ from 
native by 10%; (b) F-x curve for approximate annealed average and 
sample averaged quenched disorder at temperature (T=0.3); The na- 
tive (initial realization), approximate annealed case and individual 
quenched disorder show the multi step plateaus which are absent in 
pure case. 



We enumerate all possible walks both on square lattice with 
disorder (percolation lattice) and without disorder (pure lat- 
tice). Since, all the single molecule experiments are of few 
monomers and hence the observed effect may have finite size 
effect. In this context exact enumeration technique has been 
proved to be quite successful in order to have better under- 
standing of single molecule experiments of finite size. As the 
number of conformation increases with /i , we restrict our 
calculation for chain length up to 45 steps walk. Here [i is the 
connectivity constant of the lattice. The partition function of 
the i th realization of the disorder configuration may be written 
as 



E 

(N p ,\x\ 



C^pi \ x \ 



(D 



N (N p , \x\) is the number of distinct conformations of 



Here C 

walks of length N in the i-th realization of the disorder with 
N p non-bonded nearest neighbor pairs and whose end-points 
are a distance x apart. u>(= exp[(3(F.x)]) is the Boltzmann 
weight for the force, where x is the unit vector along the x- 
axis and [3 is defined as pp. Here, k is the Boltzmann constant 
and T is the temperature. u(= exp(— j3e u )) is the Boltzmann 
weight for nearest neighbor interaction. 

In the following, we set e/ k = 1 and focus our discussion 
on force-extension behavior in CFE and CDE. In the present 
study we consider 10000 realizations (N to t) of the correlated 
percolation clusters. In Fig. 2(a), we plot the histogram of 
Z for all 10000 realizations. It is surprising to see that each 
realization which differs from the native conformation by 10% 
only give such a wide range (10 3 to 10 10 ) of the distribution 
of Cjy. The existence of long tail in the distribution indicates 
that the standard deviation will be quite larger than the mean 
value. The log-normal distribution of the above histogram of 
5000 realization shown in inset of Fig. 2a, give mean value w 
18.2 and standard deviation w 3.1 for each set. In Fig. 2(b), we 
plot the frequency of Z with rank in the log-log scale. From 
this plot, we find power law behavior. In Fig. 2(c), we plot 
histogram at intermediate temperature and force (T = 0.3 
and F = 0.75) and found that the qualitative behavior (long 
tail) of the histogram and the power law (Fig. 2(d)) remain the 
same. 



III. AVERAGING OVER DISORDER 



order of correlation. This also ensures that the concentration 
of the crowding agent does not change, but the internal struc- 
ture may change as happens in the cell. Since we are near to 
percolation threshold, every disorder realization constitute the 
partition function which is non zero. This will permit us to 
study the effect of cellular environment in a much more real- 



Following the method developed in ref. j29ll we denote a 
sample average over various realizations as [...] and thermal 
averaging as (...). 

A. Force-Extension Behavior in CFE 

In CFE, for a given realization of disorder, we calculate the 
reaction coordinate i.e x-component of end-to-end distance 
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FIG. 4: Histogram of extension of various realizations at different forces and fixed temperature (T = 0.3). 



{x % = ^ (2) 

This may be thought as simple representation of molecu- 
lar confinement where disorder sites do not move with time. 
Therefore, we refer it as quenched average of i-th realization. 
The sample average over Eq. (O can be written as 

i 

which we call the sample average over quenched disorder. In 
Fig. 3(a) we show some of the representative plots for differ- 
ent correlated realizations. The qualitative nature of F-x curve 
almost remain same, but vary for different realizations. All 
these plots show the existence of plateaus. It may be noted 
that the number of plateaus may vary from realization to re- 
alization. The sample averaged reaction coordinate has also 
been shown in Fig. 3(b) at low temperature. As pointed above, 
because of large variation in the partition functions of differ- 
ent realizations, reaction coordinate also exhibits the similar 
variations. In Fig. 4, we plot the histogram of reaction coordi- 
nate for different realizations. With rise of force, the distribu- 
tion peak shifts to the right. However, due to the large width 
of the histogram, the plateaus seen in individual realization 

_ . 1 . • 1 _ * _1_ • T"J" _ 1 /1_ \ Til ■ _ 



does not mean the plateau vanishes. In fact all these plateau 
exist for different realization. In Fig. 5a, we plot the probabil- 
ity distribution of plateaus (P(ip)) at T — 0.3. The variation 
of sample average of number of plateaus ([ip]) with tempera- 
ture is shown in Fig. 5b. From these plots, it is evident that 
average number of plateaus decreases with temperature and 
system approaches to two-state at high temperature. 

For molecular crowding, we average over finite but large 
number of realizations. The realization in the limit N to t — ► oo 
is not an experimental requirement. Since, the correlated per- 
colation clusters are having the structures close to the initial 
cluster, therefore, obtaining all possible percolation clusters 
are beyond the scope. Hence we call the averaging over finite 
correlated clusters as an approximate annealed average. It is 
important to mention here that this averaging will vary on the 
choice of initial cluster as well as on the number of realiza- 
tions. Moreover because of long tail in the distribution, two 
different sets of realizations may not give exactly the similar 
F-x curve. However, qualitative nature of the F-x curve for 
two different sets will remain same. The reaction coordinate 
(end-to-end distance) in case of approximate annealed averag- 
ing has been defined as 

, H v ^E < E W ,N)^ ) ^ ) (^,|x|) U ^a,N 

im/D = (4) 

i T X ^ ry(i\ 1 7i t _ ._ _i _ ■ .. 7i t 
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realizations of the disorder. Though in the limit N to t — * oo, 
annealed average and pure give the same result, however, in 
present case this may not be true. Because all the correlated 
percolation clusters are having the structures close to the ini- 
tial cluster and hence all possible conformations of disorder 
are not possible to generate. In Fig. 3b, we also plot the force 
extension curve for the approximate annealed case which also 
shows the existence of plateaus / intermediate states. At 
T = 0.3, we find two plateaus (Fig. 3b) which is close to 
the average number of plateaus {[ip]) obtained by sample av- 
eraged quenched case (Fig. 5b). 



B. Force-Extension Behavior in CDE 

It is known that atomic force microscopy (AFM) experi- 
ments work on CDE. When the disorder realizations were in- 
dependent to each other Singh et.al. j29tl have shown strong 
oscillations in F-x curve in CDE for approximate annealed 
case. Surprisingly such oscillations were absent for pure (in 
absence of disorder) and sample averaged quenched disorder. 
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FIG. 5: (a) Probability of average number of plateaus at fixed tem- 
perature (T = 0.3) for 10000 disorder realizations. Insets show the 
same plot for two different 5000 disorder realizations. All these nor- 
malized plot of probability overlaps with each other; (b) Plot of aver- 
age number of plateaus ([ip]) with temperature for sample averaged 
rmenched case. 
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FIG. 6: Same as Fig. 3, but at CDE for temperature T = 0.3. 



In present case where each realization is correlated, we ob- 
serve strong oscillations (Fig. 6) in both approximate an- 
nealed and sample averaged quenched cases. This may be 
because in CDE the system probes local ground state energy 
which may be quite close for each disorder realization. How- 
ever in case of independent realizations local ground state en- 
ergy will be different for each realization and hence sample 
averaging will smoothen out such oscillations. 



C. Probability Distribution 

In Fig 7 and 8, we show the probability distribution curves 
of reaction coordinate for different forces at fixed tempera- 
ture. This provides important information about the changes 
in the structural processes in cellular environment. In our cal- 
culation we use the following expressions to calculate prob- 
ability distribution for the approximate annealed and sample 
averaged quenched cases: 
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FIG. 7: Plot of P(x) vs x for sample averaged quenched disorder case at different F (fixed temperature T = 0.3). 



For pure case the probability distribution curve has been 
studied by Singh et. al. l29ll . For the correlated disorder we 
find qualitatively similar behaviour at lower and higher forces. 
At low force, the peak in the probability distribution corre- 
sponds to the collapsed state and at high force it corresponds 
to the stretched state. It was found that for the pure case at cer- 
tain force (near the transition point) the peak broadens which 
indicates that the transition is continuous. However, for the 
correlated disorder case (for both approximate annealed and 
sample averaged quenched disorder), one can see the emer- 
gence and disappearance of peaks [Fig 7 and 8] for different 
F at T = 0.3. In case of sample averaged quenched case the 
probability distribution profile is broad in nature with many 
small sharp peaks [Fig. 7]. Since entropy associated with 
sample averaged quenched disorder is much higher, therefore, 
the width of the peak broadens. However, for approximate 
annealed case the width of the peak remains smaller which is 
evident from Fig. 8. With increasing force, we see the height 
of one peak increases while others decreases. We find at many 
different forces the height of two peaks (at different position) 
become equal indicating the existence of two states. Unlike 
pure case, this reflects that disorder may change the order of 
transition and there will be emergence of intermediate states 
which are solely induced by the disorder. 



sented here clearly shows that force induced transition as ob- 
served in vitro is quite different than in vivo. It is evident 
from our model studies that molecular crowding or molecular 
confinement induces intermediate states which may be seen in 
the form of plateaus in F-x curve at CFE. It is interesting to 
note that quenched average over different disorder also show 
the existence of intermediate states similar to approximate an- 
nealed case. Because of the heterogeneity in the number of 
non-bonded nearest neighbor interaction (number may vary 
in between to 2) along the chain and the structure of disor- 
der, we see oscillations in F-x curve in CDE for both approx- 
imate annealed and sample averaged quenched cases which 
were found to be absent for the uncorrelated disorder [291. 



There are many biological events which are rare in nature. 
The occurrence of such event may be because of long-tail be- 
havior in the distribution of partition function. Therefore, at 
this moment of time additional numerical and experimental 
works are required to understand the effect of crowding agents 
on unfolding processes seen in the cell. 
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